Evaluation of Two XML Storage Approaches for Scientific Metadata

نویسندگان

  • Scott Jensen
  • Devarshi Ghoshal
  • Beth Plale
چکیده

Scientific data are increasingly described by metadata based on detailed XML schemata that capture both general and domain-specific concepts about the underlying data. Metadata captured using detailed XML schemata tailored to specific scientific domains increases the potential for data reuse by providing the ability to discover data products described by detailed concepts. Since such metadata is captured as XML, one alternative for managing scientific metadata is to store and query the metadata using a native XML database. Our research shows that a hybrid XML-Relational structure such as is used in the XMC Cat metadata catalog outperforms a native XML database for storing and querying scientific metadata; and significantly outperforms the native XML database under a scaled workload of concurrent inserts and queries.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A System for Storing, Retrieving, Organizing and Managing Web Services Metadata Using Relational Database*

In this paper we present our system for efficient storage, update, and retrieval of web service metadata documents in relational database. Initially we investigate all of the existing approaches for efficient storage and retrieval of XML documents in relational database. As a result we selected a recently proposed structure-centered storage schema named DLN (Dynamic Level Numbering). As there i...

متن کامل

Milos: A Multimedia Content Management System

This paper describes the architecture of the MILOS Content Management System. MILOS supports the storage and content based retrieval of any XML document, as well as multimedia documents whose descriptions are provided by using heterogenous metadata models represented in XML. MILOS is flexible in the management of documents containing different types of data and content descriptions; it is effic...

متن کامل

Using Characteristics of Computational Science Metadata for Efficient XML Storage in a Relational Database

Computational science communities are generating an ever-increasing volume of data products which are tracked and cataloged using metadata communicated via one or more community XML schemas. We propose a hybrid approach to storing metadata in a relational database that draws heuristics from computational science discovery, and show that the approach outperforms the well known inlining approach....

متن کامل

بررسی واکنش موتورهای کاوش وب به پیشینه‌های فرادا‌ده‌ای مبتنی برروش ترکیبی داده‌های خرد و روش داده‌های پیوندی

The purpose of this research was to find out the reaction of Web Search Engines to Metadata records created based on the combined method of Rich Snippets and Linked Data. 200 metadata records in two groups (100 records as the control group with the normal structure and, 100 records created based on microdata and implemented in RDF/XML as experimental group) extracted from the information gatewa...

متن کامل

memasysco: XML schema based metadata management system for speech corpora

The metadata management system for speech corpora “memasysco” has been developed at the Institut für Deutsche Sprache (IDS) and is applied for the first time to document the speech corpus “German Today”. memasysco is based on a data model for the documentation of speech corpora and contains two generic XML schemas that drive data capture, XML native database storage, dynamic publishing, and inf...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011